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We develop several algorithms for performing quantum phase estimation based on basic measure- 
ments and classical post-processing. We present a pedagogical review of quantum phase estimation 
and simulate the algorithm to numerically determine its scaling in circuit depth and width. We 
show that the use of purely random measurements requires a number of measurements that is op- 
timal up to constant factors, albeit at the cost of exponential classical post-processing; the method 
can also be used to improve classical signal processing. We then develop a quantum algorithm 
for phase estimation that yields an asymptotic improvement in runtime, coming within a factor of 
log* of the minimum number of measurements required while still requiring only minimal classical 
post-processing. The corresponding quantum circuit requires asymptotically lower depth and width 
(number of qubits) than quantum phase estimation. 

PACS numbers: 03.67.Lx, 03.65.Fd 

Keywords: quantum phase estimation, inference 



I. INTRODUCTION 

Quantum algorithms promise computational speed-ups 
over their classical counterparts. Quantum phase esti- 
mation is a key technique used in quantum algorithms, 
including algorithms for quantum chemistry ^ [5] and 
quantum field theory [3 , Shor's algorithm for prime 
factorization [4], and algorithms for quantum sampling 
[SI [BJ. It can be used to find eigenvalues of a unitary 
matrix efficiently 

There are two main approaches to quantum phase esti- 
mation: (1) invoking an inverse Quantum Fourier Trans- 
form (QFT) [7HS] to extract information about the phase 
or (2) performing a basic measurement operation fol- 
lowed by classical post-processing in place of the QFT 
[TQl [TT] . An advantage to approach (2) is that it uses 
classical post-processing in place of quantum operations, 
trading off an expensive resource for an inexpensive clas- 
sical computation. In particular, the QFT requires many 
small controlled-rotations, each of which must be approx- 
imated to precision e by a sequence of basic quantum op- 
erations of length 0(log(l/e)) [12]. In practice, we may 
want to significantly reduce the circuit depth of the phase 
estimation algorithm in exchange for a small increase in 
circuit width, i.e., the number of qubits. Therefore, we 
focus on approach (2) and rely primarily on quantum 
measurements to infer information about the phase. 

We begin by outlining the goal of quantum phase esti- 
mation and explaining the basic measurement operation 
that is used as a subroutine to do this, and contrast this 
problem with the classical Fourier transform. We then 
describe various phase estimation algorithms; these al- 
gorithms all call the same basic measurement operation, 
but use different parameters to do this. 

We first present in Section |HI| a technique based on 



random measurements to infer the phase; this technique 
uses the fewest number of measurements of any we know 
(and we prove that it is within a constant factor of opti- 
mal), but it requires impractical classical post-processing 
for use in, say, Shor's algorithm [3], with a complexity 
that is exponential in the number of bits being inferred. 
However, this technique may be practical in certain clas- 
sical noisy signal processing and inference applications, 
where the number of bits being inferred is smaller. We 
explain these applications in this section and give some 
extensions of the technique that may be useful in infer- 
ring very noisy, sparse signals. 



In Section [IV[ we review a quantum phase estimation 
algorithm based on the same measurement operation, 
but the measurements are not random and the classical 
post-processing can be done efficiently [TUl E] . We simu- 
late this algorithm and determine its complexity, circuit 
depth, and circuit width for various sizes of input. 

In Section [Vj we improve upon this phase estima- 
tion algorithm by considering inference across multiple 
qubits. We show that this technique requires asymptoti- 
cally fewer measurements, and in turn has a correspond- 
ingly (asymptotically) smaller circuit width and depth, 
while still allowing efficient classical post-processing. 

We compare the circuit constructions for Kitaev's 
phase estimation algorithm and the fast phase estima- 
tion algorithm in Section |VI[ Three models of computa- 
tion are discussed: the first is a sequential model with 
limited parallelism, the second is a highly parallel model, 
and the third is a model based on a cluster of quantum 
computers. 



II. PHASE ESTIMATION AND THE BASIC 
MEASUREMENT OPERATION 

We begin by reviewing the goal of quantum phase es- 
timation and the basic measurement operator, following 
the algorithm of Kitaev [TO] (see Ref. [TTJ for complete 
details). We derive the steps slightly differently, in antic- 
ipation of our extension in the later sections. 

Assume that we have a unitary operator U and we 
would like to estimate the eigenvalues A& of U given U 
and the eigenvectors |£fc): 



&1&> = A fc |&) 



(1) 



where the eigenvalues take the form A& = e 2m ' (pk . The 
phase ifk is a real number modulo 1 , which can be repre- 
sented as a unit-length circle: <pk = j mod 1, ipk € R/Z, 
< k < t < 2 m (while it may seem more natural at first 
to instead consider numbers that range between and 
2-7T, rather than choosing a number between and 1 and 
multiplying by 27r as we do here, we choose the latter be- 
cause it will be more natural when later considering an 
expansion of tfk as a binary fraction). By measuring the 
eigenvalues of U, we can obtain an estimate of the phase 
(fk ; this process is called quantum phase estimation, rj 

The goal of all phase estimation algorithms is to take 
a state of the form |£/-) and determine the corresponding 
eigenvalue A&. The measurement operation described be- 
low commutes with U , so we can apply it multiple times 
to the same state with different parameters to improve 
our knowledge of the eigenvalue. There are two param- 
eters in the measurement result: (1) the precision 5 and 
(2) the probability of error e. That is, we obtain some 
estimate a of if), where, with probability at least 1 — e, 
\ct — <Pk\ mod i < S, where mod 1 is the distance on the 
unit circle. If ipk is chosen from a discrete set of angles 
| with fixed t and unknown k, our goal is to make the 
precision smaller than S = ^ so that we can determine 



2/ 



ifk exactly from this set. In Section III wc slightly sim- 
plify the problem by directly inferring the angle <fk from 
the discrete set of possible angles, without bothering to 
introduce a real number precision; in this case e is the 
probability of error in our discrete inference. 



A. Basic Measurement Operation 

We begin by constructing a measurement operator 
such that the conditional probability depends on ipk, that 
is, upon measuring this operator, we learn some informa- 
tion about (pk . This construction relies on the fact that if 



1 In the context of Shor's algorithm, the corresponding eigenvector 
is defined as |£ fe > = -^ ^1=0 e' 2 ^"^ \a n ). 
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FIG. 1: Circuit to perform the measurement operator. 



|£fc) is an eigenvector of U, then it is also an eigenvector 
of powers M of U: 



U M |&> = Af fa) 



(2) 



The operator takes as input two quantum registers: 
one initialized to |0) and the other initialized to the eigen- 
vector |£fc). The operator depends upon two parameters, 
a "multiple" M and an "angle" 9, where M is an integer 
between 1 and 4—1 (to make it practical to implement, 
we restrict to positive integers M ) and 6 is a real number 
between and 27r. 

The measurement operator used to measure the eigen- 
values is as follows: 



H M ,fl(C0 



E 



j I e 2-7TiM-ip k +i6 j _ JZmM-(pk+i6 
I _ e 2iriM-cp k +i6 i _|_ e 2wiM-<p h +i6 

1 + U M exp(i6) 1 - U M exp(i8 
1 - U M exp(ifl) 1 + U M exp(i6» 



(3) 



which acts on the quantum states by the following trans- 
formation: 



|o) 1&) Sm ^ {u) 

^ I e 2iriM-ip k +iO 



(4) 



|0) 



-i _ 2iriM-ip k +i6 



1) )®\&). 



The corresponding circuit is shown in Fig. [Tl The gate 
Z(0) corresponds to the unitary: 



Z{6) = 



1 
e w 



(5) 



It follows that the measurement outcome probabilities 
are given by: 



PM,e(0\k) 
and 

PM.eiMk) 



1 



a 2wiM-ipk+iO 



-i _ 2TviM-ip k +i6 



1 + cos(2ttM -ipk + 9) 
2 : 

(6) 



1 - cos(2ttM -cpk + 0) 



(7) 

We write these probabilities as conditional probabilities 
to emphasize that they depend upon the unknown k. 



B. Relation To Classical Fourier Transform and 
Generalizations 



ability of getting a given sequence of measurement out- 
comes is 



With Eqs. (6J7| in hand, we can see that if we apply 
a large number of measurements using the same M at 
both 9 = and 9 = w/2, we can accurately estimate 
cos(2-7rM ■ ipk) and sin(27rM • tp k ). Using a sufficiently 
accurate estimate of these cosines and sines at two dif- 
ferent values of M allows us to determine (ft- accurately. 
This is the problem of reconstructing a sparse signal (in 
this case, composed of a single Fourier mode) from its 
value at a small number of different "times" (i.e., differ- 
ent values of M). However, the accurate determination of 
cos(27rM • ifk) would require a very large number of mea- 
surements, polynomial in t, while other methods require 
many fewer measurements. The reason is that the large 
number of measurements at a fixed value of M means 
that each measurement imparts little additional infor- 
mation. By varying M, we are able to obtain accurate 
results from a much smaller number of measurements. 

This relates to a problem of reconstructing the Fourier 
transform of a signal from very noisy measurements. The 
quantum phase estimation problem involves a signal with 
a single Fourier mode. However, this gives rise to a nat- 
ural generalization of reconstructing a problem with a 
small number of Fourier modes from very noisy measure- 
ments. We consider this problem at the end of the next 
section. 



III. "INFORMATION THEORY" PHASE 
ESTIMATION 

One procedure for estimating the phase (or angle) is to 
perform a series of random measurements and then solve 
a hard classical reconstruction problem. We measure the 
operator at a set of randomly chosen multiples Mj and 
angles 9i and classically reconstruct the angle 2n(pk- In 
this section, we show that we can determine <fk with only 
0(\og(t)) measurements; we also show that this result is 
tight. 

We randomly select Mj for each measurement i be- 
tween 1 and t — 1, and also assume a small randomized 
offset noise 9i = 2irr, where r is a random double. The 
conditional measurement probabilities for this measure- 
ment operator on the i th measurement are given by: 

P,(0|*) = ! + <™(2*y •¥* + »<), (8) 



and 

P I (l|fc)= 1 ~ COS(2 ^- yfc + ^ ) =l-P i (0|fc). 0) 



Let Vi be the outcome of the i th measurement. Since 
different measurements are independent events, the prob- 



P(v 1 ,...,v s \k)=Y[P i (v i \k). 



(10) 



Assuming a flat a priori distribution of k, the probabil- 
ity distribution of fc given the measurement sequence is 
proportional to P(v\, ■ ■ ■ , v s \k). The algorithm then to 
compute k given a sequence of s measurements is simple: 
for each k compute the probability P(«i, . . . ,v s \k), out- 
putting the k which maximizes this. The post-processing 
time required is of order si, which is exponentially large 
in the number of bits inferred since the value of k that it 
outputs can be written with |"log 2 (£)] bits. 

The information theory phase estimation algorithm is 
given in Algorithm [ll 

Algorithm 1 Information Theory Phase Estimation 

1: for i — 1 to s do 

2: Choose random Mi. Choose random 9i. 

3: Perform basic measurement operation with multiple 

Mi and angle 6i. 
4: end for 
5: Maximize 

s 

P(vi,...,v.\k) = 1 [[P t (vi\k) 

over all choices of k. 
6: return k/t, the estimate of the phase. 

To illustrate, we simulated the probability of inferring 
the given angle 2irtfk among t = 10 equally distributed 
possible angles. Figures 2a -2e plot the inferred proba- 



bility distribution as a function of angle after s measure- 
ments, where s = {10, 20, 30, 40, 50}. The black diamond 
on each plot indicates the peak at the correct angle. From 
the plots, we see that after 10-20 measurements, the in- 
ference is very noisy, while after 40 or more measurements 
it has inferred some information about the correct angle, 
and after 50 measurements it is very precise. 

In Figure [3j we plot simulation results for inferring 
the given angle 2irtp k among t = {10 1 , 10 2 , 10 3 , 10 4 , 10 5 } 
equally distributed possible angles. The x-axis is the 
number of random measurements s and the y-axis is the 



probability that the k which maximizes Eq. ( 10 1 after 
s measurements is the correct angle. Clearly, as t in- 
creases, the number of measurements increases, following 
an 0(log(£)) behavior. 

A. Bounds on s 

We now show that 0(log(t)) measurements suffice to 
estimate the angle with high probability. This number of 
measurements required is asymptotically optimal (up to 
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(a) 10 random measurements. 
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(b) 20 random measurements. 



(c) 30 random measurements. 
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(d) 40 random measurements. 



(e) 50 random measurements. 



FIG. 2: Results of simulating the probability of inferring a given angle among t = 10000 equally distributed possible 
angles. Plots are of the inferred probability distribution as a function of angle after 10-50 random measurements. 

The correct angle is marked by a black diamond. 



constant factors), as clearly |_log 2 (*)J measurements are 
required to have an error probability greater than 1/2: 
after s measurements, there are at most 2 s possible out- 
comes for the sequence of measurements, so to select an 
angle from a set of t choices with probability greater than 



1/2, we need 2 s > t/2. A more sophisticated entropic ar- 
gument would likely be able to improve the constant in 
front of this lower bound. 

The next theorem implies that the number of measure- 
ments to obtain error probability at most e is log 1 / c (t/e) 
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FIG. 3: Results of simulating "information theory" phase estimation for t = {10 1 , 10 2 , 10 3 , 10 4 , 10 5 } equally 
distributed possible angles. The x-axis is the number of random measurements s; the y-axis is the probability that 

the most likely angle after s measurements is the correct angle. 



for some constant c < 1. 

Theorem 1. Suppose we choose the multiples Mi and 
angles 9i at random, as above. Suppose the measurement 
outcomes are chosen with probabilities given in Eqs. pjw 
for k = ko- Then, the probability e that the algorithm de- 
scribed above chooses a k' ^ fco a $ the choice with maxi- 
mal likelihood is bounded by 



now over all choices of 6% , Mj . This equals 



tc s 



(11) 



for some numerical constant c strictly less than 1 (c does 
not depend upon t). 

Proof. We first consider a given k' ^ k and estimate the 
probability that after s measurements, the probability 
P(vi, . . . , v s \k') — Y[t=i Pi(vi\k') is greater than or equal 
to P{v\, ...v s \ko). Consider the expectation value 



E 



rP(v 1 ,...,v s \k') 1/2 



p(v u ... : v s \k yn 



(12) 



where the expectation value is over measurement out- 
comes and choices of Mi and 6i . This equals 



E 



{Mifti} 






p( Vl ,...,v s \kr /2 

p( Vl ,...,v s \koy/ 2 



P(vi ...,v s \k ) 



E imM [E P ^ ' ■ ■ ' VsW) 1/2 P(vi, • ■ • , ^ ) 1/2 



where the sum is over all 2 s possible sequences v±, ...,v s 
of measurement outcomes and the expectation value is 



(EmjI^PmAvW^PmA^o) 1 



/2 



(13) 



where Em $[■■■] is the expectation value over M,6. A 
direct calculation shows that for all k' ^ k, the term 
in parenthesis E M ,e[J2v PM,e{v\k') 1/2 PM,e(v\k ) 1/2 } is 
bounded by some constant c < 1 for all t. Thus, 
the expectation value (12) is bounded by c s . Thus, 
for a given k' , the probability that P{v\, . . . , v s \k') > 
P{v\, . . . ,v s \k ) is bounded by c s , as can be shown by 

applying Markov's inequality to p[ll[Z'^ s \t ]i/2 - 

Thus, the probability that there is a k! such that 
P{v\, . . . , v s \k') > P(vi, . . . ,v s \k ) is bounded by tc s . □ 

We have not bothered to optimize the estimate in 
the above theorem: it is possible that a tighter bound 
could be considered by estimating the expectation value 

E[( pfa 1 '""'" 3 fc { ) ] for some constant < a < 1 and 
optimizing the choice of a in the spirit of the Chernoff 
bound. 

Finally, we remark that while we have selected 9 ran- 
domly between and 2tt in the above algorithm and in 
the above theorem, in fact it would suffice to pick 6 ran- 
domly from the set of angles {0, 7r/2}, or indeed from any 
set of a pair of angles that do not differ by exactly 7r (for 
example, the set {0, it} would not work). The proof of the 
theorem would be essentially the same in this case, and 
restricting to such a smaller set of angles may be more 
convenient for implementation on a quantum computer. 



B. Classical Inference of Multiple Fourier Modes 

The results above suggest a natural generalization of 
the problem. Define a classical channel E(x) which maps 
from a real number x between —1 and 1 to an output 
consisting of a single bit. We fix the output probabilities 
of this channel: 



P(0\x) 



P(l\x) 



1 + x 



1-x 

2 



(14) 



(15) 



Then Eqs. (8J9) can be interpreted as follows: for 6i = 
0, for any Mi, we take the number cos(27rMi • iff.) and 
input this number into the channel and the output of the 
channel is the measurement outcome, while for 6i = 1, 
we instead input sin(27rMi • tpk)- 

This then suggests a natural generalization. Consider 
a classical signal written as a sum of Fourier modes: 



f(M) = Y, o(fc) exp(2«M • tp k ). 



(16) 



Here, M is an integer and the function is periodic with 
period t. 

Then, we have the natural classical problem: 

Problem 1. Assume that f(M) is K-sparse, meaning 
that at most K of the coefficients a(k) are non-zero. As- 
sume that the non-zero a(k) are chosen from a discrete 
set S of possible values (typically we will be interested in 
\S\ being small), with tnhx a ^b,aeS,bes\a - b\ > d min for 
some dmin. The a(fc) may be complex. 

Let A max be the maximum of |/(M)| over all such K- 
spar.se a(k) and over all M. 

Assume that we have some channel C(x) which maps 
from a real number in the range [— A max , A max ] to an out- 
put chosen from a discrete set (the channel C(x) need not 
be the same as that given above in Eqs. (1J\15 )). For this 
channel to be useful in inferring x from measurements of 
the output, we will require that different input numbers 
lead to different output probabilities, and we will quantify 
this more precisely below in Eq. (_?7[ ). 

Pick several different Mi, and for each Mi measure 
C(Re(/(Mi))) or C(Im(/(M,))). Infer the coefficients 
a{k). 

This problem can be interpreted as inferring a classical 
sparse signal from noisy measurements at several differ- 
ent "times" (interpreting each Mi as a time at which 
to infer the signal). We now show, given suitable as- 
sumptions on C(x), that this problem can be solved us- 
ing a number of measurements that is 0(log(N c } W i ces )), 
where N cno i ces is the number of possible choices of K- 
sparse f(M). As before, this number of measurements 



is asymptotically optimal. Note that for K « t, 
log{N chozces )^K\og(t\S\). 

The procedure we describe is similar to that previously: 
we select random Mi and randomly choose whether to 
measure C(Re(/(M;))) or C(Im(/(M,))) at each time. 
After s measurements, we select the choice of a(k) which 
has the maximal a posteriori likelihood, assuming a flat 
initial distribution. Interestingly, since the number of 
measurements we need is asymptotically much smaller 
than y/i (indeed we only need 0(log(t)) measurements 
if K = 0(1)), this means that this random procedure 
typically does not ever pick M, = Mj for i ^ j. That 
is, interpreting the Mj as "times" , this means that we do 
not ever measure the signal twice at the same time. 

Note that we have assumed that the non-zero coef- 
ficients are chosen from a small set S of possible val- 
ues. As the number of possible values of S increases, 
the number of measurements increases for two reasons. 
First N cn oices increases. Second, the values in the set 
become more closely spaced [d m in becomes smaller com- 
pared to A max ), and the measurement outcomes proba- 
bilities hence become less sensitive to the particular value 
of a(k). This second problem is actually the more serious 
one. Suppose that we have a signal that is 1-sparse, and 
we even know that the only non-zero a& is at k = 0. The 
question is to infer the magnitude of ao- Every measure- 
ment then consists of sending ao into the channel C(x). 
Using the channel C(x) before, it takes 1/e 2 measure- 
ments to infer a to precision e. This number of measure- 
ments is exponential in the number of bits of precision in 
ao. That is, it takes many more measurements to infer 
the amplitude of a Fourier coefficient than it does to infer 
its frequency. 

Theorem 2. Suppose we choose the multiples Mi at ran- 
dom as above and randomly choose whether to measure 
C(Re(/(Mi))) or C(Im(/(Mj))) at each time. Suppose 
also that C(x) has the probability that for any x, y £ 



-^max7 -^m,ax\ 



we have 



PivlxY^Pivly^Kl-colx-yl 2 (17) 



for some constant cq, where the probabilities P(v\x) are 
the probability that the channel C gives output v given in- 
put x. Then, the probability e that the algorithm described 
above chooses akl ^ k$ as the choice with maximal like- 
lihood is bounded by 



where 



c< 1 



^ choice sC 



/d \ 2 d 2 
I "mm \ a min 

C °\ 2 ) 16A* 



(18) 



(19) 



Proof. Assume the correct choice of a(k) is given by 
ao(fc). We consider a given sequence a'(k) (such 



that for at least one k, a'(k) ^ ao(k)) and estimate 
the probability that after s measurements, the prob- 
ability P(vi, . . . ,v s \a'(k)) is greater than or equal to 
P(vi, ...v s \a Q (k)), where v±, . . . ,v s are the measurement 
outcomes of the channel. 
Let 



Hence, by the assumption (17) on C(x), we have that the 



term in parenthesis in Eq. ( 23 1 is bounded by 



c<l-c {— - 



i 2 ■ 

III 1 1) 



16AI 



(24) 



n 



/o(M) = J2 a o(k) exp(2mM ■ (p k ) 



and 



f(M) = Y^ a '0) exp{2iriM ■ 



Vk)- 



Consider the expectation value 



E 



P( Vl ,. 



,,v, 



\a'(k)) 



1/2 



P( Vl ,...,v s \a (k)y/ 2 



(20) 



(21) 



(22) 



IV. 



KITAEV'S PHASE ESTIMATION 
ALGORITHM 



where the expectation value is over measurement out- 
comes and choices of Mi and choices of real or imaginary 
part. This equals 



HI 



P(v 1 ,...,v s \a'(k)) 1 / 2 



■,v.\ao(k)) 1/2 
,v.\a'(k))V 2 P(vi 



P(vi ...,v a \ao(k)) 



Recall from Section |IIB| that if we apply a large num- 
ber of measurements using two different values of M at 
both 9 = and 9 = n/2, we can accurately estimate 
cos(27rM • <fk) and sin(27rM ■ (fik), and therefore deter- 
mine ipk- In this section, we review Kitaev's phase esti- 
mation algorithm to determine <pk with exponential pre- 
cision [10) (for complete details, we refer the reader to 
Sec. 13.5 in Ref. [TTj) . This algorithm relies on obtain- 
ing accurate measurements at multiples of ifk. We begin 
by reviewing how to accurately measure a given multiple 
of ifk with constant precision, building up to estimating 
the phase with exponential precision. We also simulate 
the algorithm to determine how many measurements are 
required in practice. 



|ao(fc)) 



1/2 



where the sum is over all possible sequences Vi,...,v s of 
measurement outcomes and the expectation value is now 
over all choices of 9 t and of real or imaginary part (Ri = 
0, 1 is used to denote a measurement of real or imaginary 
part). This equals 



A. Estimating ft with Constant Precision 



Recall that ifk 



mod 1, where fk G R/Z and 






M=0 



P(v\Re(fo(M))y/ 2 P(v\Re(f(M))y/ 2 
2 



< k < t < 2 m . Let Oi = {0,tt/2} at ran dom. Us- 
ing the measurement operator given in Section |IV A| and 
Eqs. (6|7|, the conditional probability when measuring 
multiple M = 1 is given by: 



P(v\Im(f (M))) l / 2 P(v\lm(f(M))) 1 /' 



P(Q\k) = 



1 + C0S(27T • Ifik + 9i) 



(25) 



} S .(23) 



Below, we will use the assumptions on C(x) to show 
that the term in parenthesis in Eq. p3| is bounded by 
some constant c < 1 for all t. Using this bound, the 
expectation value (22 1 is bounded by c s . Thus, for 



given a'(fc), the probability that P(vi, . . . ,v s \a'(k)) > 
P{v\, . . . , v s \ao(k)) is bounded by c s . Thus, the probabil- 
ity that there is an a'(k) such that P(vi, . . . , v s \a'(k)) > 
P(v\, . . . , v s \ao(k)) is bounded by N c hoicesC s , as claimed. 



We now bound the term in parenthesis in Eq. ( 23 1 . 
Consider \ J2 M \I'{M) - f Q (M)\ 2 . This is greater than 
for d 2 mm . Also, for every M, \f(M) - f (M)\ 2 < 
AA 2 nax . So, for randomly chosen M, the probability that 
\f(M) - /o(M)| 2 is greater than or equal to d 2 mm /2 is 
at least d^^jSA^^. So, the probability that if we ran- 
domly choose M and randomly choose whether to mea- 
sure real or imaginary part, that the corresponding part 
(i.e., either real or imaginary) of f'(M) — fo(M) is greater 
than d m i n /2 in absolute value is at least d\ lin j'\&A\ iax . 



We now solve for the conditional probability P(0\k): 
2P(0\k) - 1 (26) 

= C0S(27T • (fik + 9i) 

= cos(27r • ifk) cos 9i — sin(27r • ip^) sin^. 

We make s measurements, choosing 9i G {0, 7r/2} ran- 
domly, to obtain approximations P* os and P* in close to 
cos(27r-iy9fc) and sm(2TT-(pk), respectively. Let there be N c 
measurements with 9 L = 0. Let iV c (0) denote the number 
of these measurements having outcome and let N c (l) 
denote the number having outcome 1. Then, let 

N c ( 0) - JV C (1) 
N„ 



COS 



(27) 



If there are N s measurements with 9i = 7r/2, with N s (0) 
of them having outcome and N s (l) having outcome 1, 
then let 



p* 



N„{l)-N a {0) 



(28) 



Given P* os , P* in , our best estimate of ipk is obtained by 
taking an arctangent of P s * in /P c * os , choosing the appropri- 
ate quadrant. 

Equivalently, we can determine multiples Mi of tpk in 
the same manner by measuring and obtaining the prob- 
ability 

, , , 1 + cos(27rM; ■ ifiu + 6A , s 

P(0|fc) = v -^— ^ l -±, (29) 

and computing similar estimates P* and again taking an 
arctangent. 

In practice, how many measurements s are needed to 
accurately determine Mi ■ ip^l This is analyzed in the 
next two sections. 



B. Estimating ipk with Exponential Precision 

To efficiently achieve exponential precision in our es- 
timate of ifk, we measure multiples Mi of ipk- Then we 
use the measurement results in a classical inference tech- 
nique to enhance the precision of the estimate. We begin 
by measuring multiple M = 2 m ~\ then M x = 2 m ~ 2 , 
increasing the precision as we move to M m _i = 2°. Each 
measurement gives us an estimate of Mitpk mod 1. 

To achieve the desired precision and probability of er- 
ror, we measure each multiple s times, where in this sec- 
tion, s refers to the number of measurements per multi- 
ple for both cosine and sine, so that the total number of 
measurements required is 2ms. The estimate of 2^~ 1 • ip^, 



using methods of Sec. IV A is denoted as pj. 

We introduce binary fraction notation, where 
.a\. .. otj — J2p=i2~ p atp, ot p £ {0,1}. The output of 
the algorithm is a — .a\ . . . a m+ 2i which is an exponen- 
tially precise estimate of ip^: 



\a- <fk\ < 



1 



2>n+2 ' 



(30) 



Kitaev's phase estimation algorithm [TU] is given in Al- 
gorithm [2] 

Algorithm 2 Kitaev's Phase Estimation [TO] 

1: for j — m — 1 to 1 do 

2: Set pj to the estimate of 2 J_ • ipy, using O(s) mea- 

surements per j. 
3: end for 
4: Set .a m Q. m +\Ot m +2 = fim, where /3 m is the octant value 

{§, !,..., |} closest to p m - 
5: for j — m — 1 to 1 do 
6: Infer a,: 



if \.0aj + iaj+2 - pj\ mo d 1 < 1/4. 

1 if |.lay+iay+2 — pj\ mod 1 < 1/4. 



7: end for 

8: return a, the estimate of the phase. 



Note that in Algorithm [2j we modify the inference step 
in line 6] to use pj , as opposed to using f3j as done in 

Rfif.nr 



C. Simulation Results 

How large does s need to be to estimate ipk to expo- 
nential precision? The probability that a given estimate 
of 2i~ 1 tpk differs by more than a given amount from the 
true value is exponentially small in s, as shown in Ref . 1111 
using a Chernoff bound. This implies that to accurately 
compute the word (the entire sequence of bits a) , we need 
s to scale logarithmically with to. 

We ran 10000 independent simulations of Algorithm 
[2] for words of length to = {1000, 10000}. These word 
lengths are of particular interest since Shor's algorithm 
promises computational speed-ups over its classical coun- 
terpart for word lengths around 2048-4096. We consid- 
ered performance of the algorithm as we varied the num- 
ber of measurements s of each multiple. Figure HI shows 
the numerical results. The x-axis is the number of mea- 
surements s. The y-axis is the probability, where we plot 
both the probability of a given bit being wrong (blue) 
across all bits and simulation runs, and the probability 
of a given word being wrong (red) across all simulation 
runs (i.e., the probability that at least one bit in the word 
is wrong). For both word lengths, we see that s scales 
logarithmically in to, and does not exceed 64. The num- 
ber of measurements required thus scales as 0(m log(m)). 
The corresponding classical post-processing circuit scales 
as 0(m) size and 0(log(m)) depth. 



V. FAST PHASE ESTIMATION 

In this section, we extend Kitaev's algorithm for phase 
estimation by considering inference across multiple bits 
simultaneously. We begin by describing an algorithm 
that improves the number of measurements 0(TO,log(m)) 
in the previous section to 0(TOlog(log(TO,)). This algo- 
rithm consists of two "rounds" , where the first round is 
similar to Kitaev's algorithm, but the second round infers 
multiple bits simultaneously. Having described this algo- 
rithm, we then describe how to further improve it by con- 
sidering more rounds, requiring 0(m log(log(log(m)))) 
measurements for three rounds, and so on, ultimately 
describing an algorithm that requires 0(m log* (m)) mea- 
surements, where log* (to) is the iterated logarithm and 
is bounded for all practical purposes by 5. These algo- 
rithms all require only an amount of computational time 
for classical post-processing that is 0(?Tilog(TO)) as dis- 
cussed at the end of the section. 

The algorithms in this section can be motivated as fol- 
lows: the limitation of Kitaev's algorithm is that it infers 
single bits at a time, and requires logarithmically many 
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FIG. 4: The number of measurements s versus the probability of a given bit being wrong (blue) and a given word 

being wrong (red), meaning probability that a given word has at least one bit wrong, (a) Simulation results for 

words of length 1000 bits, (b) Simulation results for words of length 10000 bits. 



measurements per bit. So, a natural generalization is to 
consider multiples M that are not powers of two, so that 
we can infer multiple bits at a time. The information the- 
ory method does this, by using random M, but requires 
lots of post-processing. So, in this section we consider 
"sparse" M, in that the M will be a sum of a small num- 
ber of powers of two. There is a tradeoff, in that as the 
"density" (defined to be the number of powers of two) in- 
creases, the number of measurements required is reduced, 
but the postprocessing becomes more complicated. So to 
make the inference efficient, we use a bootstrapping pro- 
cedure with multiple rounds, with the density increasing 
from one round to the next. The early rounds yield only 
imperfect inferences, but they give enough information 
to simplify the inference in later rounds. 



dicate that this is the precision on the first round. This 
quantity <5i will be much larger than the final precision 
6 of our two-round algorithm, but will be much smaller 
than 1. We say "almost every" j because, as we will see, 
we will only be able to give this precise estimate Pj for 

< j < m — log(l/(5i); however, since log(<5i) will be 
much smaller than m, this will indeed be most of the j. 
To compute pj, we use /3j+z for I = 0, ...,log(l/c5i) in a 
Kitaev-style inference procedure to compute log(l/(5i)+2 
bits in the binary expansion of pj. That is, we obtain 
the three lowest order bits in the binary expansion from 
/3j+iog(i/<5i)- We then sharpen the estimate, obtaining the 

1 bit in the binary expansion from /3 J+ ;_i and from the 
I + 1 and I + 2 th bits, proceeding iteratively. We can 
bound the probability of error in pj by 



A. Two Round Algorithm 

The measurements that we use in the first round of the 
two-round algorithm are equivalent to those of Kitaev's 
algorithm, except that the parameter s will be chosen 
differently. We set s — s± for some si chosen later (we 
call this quantity si as in the second round we will have 
an S2 , and so on) . Using a Chernoff bound estimate as in 
Ref. QTJ we can bound the probability that the difference 
between 2 J_1 • ipk and our best estimate of 2 J_1 ■ tp k is 
greater than 1/16 by exp(— cs\) for some constant c > 0. 
For notational simplicity, we will use one piece of notation 
that was used in Kitaev's original algorithm: for each j, 
we will let Pj be the closest approximation in the set 
{2, .... 1} to the estimate of 2 : ' _1 • ip-^. So, the probability 
of an error larger than 1/8 in f3j is bounded by exp(— cs±). 

In the original Kitaev algorithm, we then combine 
these j3j to estimate tpk. Instead, our goal in the first 
round of the two-round algorithm is to give, for almost 
every j, a quantity called pj that will be an estimate of 
2 J ~ 1 • ipk to a precision S±, where the subscript 1 is to in- 



Pr Pj-V -ip k >Si < log(l/<5 1 )exp(-csi). 

mod 1 

(31) 
The factor of log(<5i) occurs because to obtain an error 
less than Si requires log(l/<5i) bits of precision. This esti- 
mate of the probability of error is essentially the same as 
the estimate of the probability of having an error in Ki- 
taev's original algorithm, except that instead of having m 
bits in the expansion, we have \og(l/6\) bits. The event 
of having large error for some given j is uncorrelated 
with the event of having large error for bits j' if \j' — j\ 
is large enough compared to log(l/<5i). This will play an 
important role in analyzing the algorithm later, allowing 
us to neglect certain correlations (we will explain this 
below, although we will not give a mathematical proof 
of this). The fact that we have only obtained the accu- 
rate estimate of pj for j < in — log(l/<5i) will not pose a 
difficulty in what follows; this will be only a minor tech- 
nical detail. For one thing, most "sets of measurements" 
(as defined below) do not "contain" (also defined below) 
the j for which we do not have an accurate estimate. 
Alternatively, we can simply on the first round infer all 
Pj for j < m accurately by running the first round on 
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to + log(l/5i) bits. 

The second round uses S2TO "sets" of measurements, 
for some parameter s 2 chosen later, where each set of 
measurements will consist of repeating the same mea- 
surement a total of C times, for some constant C. We 
also introduce a parameter S, called the "density" in this 
round. For the i th set of measurements, we pick S differ- 
ent random values of j in the range 1 < j < m, calling 
these values j\, —■,]%■ We will require that these val- 
ues j\,---,j l s all be distinct from each other in a given 
measurement (if any two are equal, we simply generate 
another 5-tuplc of values; we will have S « \fm so a 
random tuple will have distinct entries with probability 
close to 1). Then we estimate 



Algorithm 3 Fast Phase Estimation 



2*" 



2-72- 



2Js- 



fk. 



(32) 



calling this estimate <7j. We do this estimate using C 
applications of the basic measurement operation, with 
Mi = 2 J i~ 1 + ... + 2 J s~ 1 for each measurement and 9i 
being chosen randomly in {0, 7r/2}. The constant C will 
be chosen so that 



First Round: 

for j — m — 1 to 1 do 

Estimate V~ x ■ ipt using O(l) measurements per j. 
end for 
Later Rounds: 
for r = 2 to Number of Rounds do 

Set density, S, and number of measurements per bit, 
s r , for given round. 
for i = 1 to s r m do 

Set Mi to a sum of S different powers of two, 
choosing these powers of two at random or with a 
pseudo-random distribution. Perform O(l) mea- 
surements with given Mi and random or pseudo- 
random 6. 
end for 
end for 

Perform multi-bit inference to determine estimate of /3'j = 
2 J ~ 1 ■ <fk for all j. Use estimates from previous round to 
give starting point for inference in next round. See text 
for details. 

Set .a m a m +ict m +2 = fi' m ; 
for j — m — 1 to 1 do 



Infer a, 



if |.0o!j+iaj+a 

1 if |.lay + iay+2 






mod 1 < 1/4. 
mod 1 < 1/4. 



16: end for 

17: return a, the estimate of the phase. 



Pr 



M t ■ ip k 



mod 1 



> 1/32 



< 



(33) 



The constant C is of order unity and does not depend 
upon to. 

This completes the description of the measurements in 
the two-round algorithm. We now describe the classical 
post-processing phase. We will explain below how to esti- 
mating a quantity /3' for each j. This quantity will be an 
approximation to 2 : >~ 1 ipk, chosen from the set {8, ..., ?}. 
The goal of the algorithm is to obtain an estimate such 
that for all j we have 



Pr 



2 J - 



<Pk - & 



mod 1 



> 1/16 



< 



(34) 



for some constant e. Thus, by a union bound, the prob- 
ability of an error greater than 1/8 in any of the /?'• will 
be bounded by e. We then use the /3j to determine the 
ctj using a procedure similar to Kitaev's algorithm. This 
procedure is given in Algorithm [31 steps 13 — 17. 



If the error is bounded by 1/8 for all /?', then the esti- 
mate of the phase will be accurate to 2~ l - 2n+2 \ 

To estimate 2- J_1 • ipf. for a given j, consider all sets 
of measurements such that one that one of the random 
values of j a was equal to j; we say that such a set of 
measurements "contains j" . On average, there will be 
S2<S' such sets of measurements. Let us first proceed by 
assuming that there are indeed exactly S2S sets of mea- 
surements and then later deal with the fluctuations in 
the number of sets of measurements. On the i th set of 
measurements, we obtain some estimate of Oi. Suppose 
this set contains j. Without loss of generality, let us sup- 
pose that ji = j. Then, given only Ui and pj 2 , ..., pj s , 
our best estimate of 2 J_1 • ip^ is: 



T i Pj2 Pj3 



Pjs 



(35) 



We now bound the probability that the estimate is 
off by more than 1/16. We do this by bounding the 
probability that our value of Ui differs by more than 1/32 
from the correct value using Eq. ( 33 1 and also bounding 



the probability that our estimate of Xw=2 Pit differs by 
more than 1/32 from the correct value. To bound that 
probability, we have 



Pr 



<[!(£ 



P.ii 



-2 31 



■ : r 



1=2 



1 



> 

mod 1 o2 



< 51og(32S")exp(-csi), 
(36) 
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where we have taken §i = 1/32S in Eq. (31) so that if 
each quantity pj x — 2 J, ~ 1 ■ ip is accurate to within <5i then 
the sum is accurate to within 1/32. We then use a union 
bound: if the probability that any given measurement is 
inaccurate is bounded by log(l/<5i) exp(— csi), then the 
probability that at least one measurement is inaccurate 
is bounded by S times that quantity. 

We choose s\ ~ log(log(m)) and S ~ log(m) so that 



Eq. (35) differs by more than 1/16 from 2-? l ■ ipk- For a 



the right-hand side of Eq. (36) is bounded by 1/32. 
Then, using Eqs. (33|36[), the probability that the quan- 



tity in Eq. (35) differs by at least 1/16 from 2 J ■ ip^ is 



bounded by 1/4. We get roughly s 2 S different estimates 
of 2- 7-1 • ifk, one for each set of measurements involving 
the given j. Let us assume the independence of certain 
events between different sets of measurements, namely 



the event that the quantity in Eq. ( 35 ) differs by more 
than 1/32 from 2 J_1 • ip^ (we discuss this further below). 
Then, we can combine these measurements to obtain an 
estimate of /?'• by picking the value of /?'■ which is most 

frequently within 1/16 of X);=2 ( Pji ~~ 2 j!_1 ■ ip I ; i.e., it is 
within 1/16 of that value for the greatest number of sets 
of measurements containing j. 

The probability of error in /3' by more than 1/16 is 
then bounded by exp(— c's2S) for some constant c' > 0. 
Picking s 2 ~ lj we find that the probability of error is 
l/poly(m) for any desired polynomial, with the power 
depending upon the ratio between S/log(m), so we can 
ensure that this probability is small compared to e/m. 
The number of measurements required by this procedure 
is 0(m log(log(m))). 

We now discuss several issues of correlations and fluc- 
tuations that were left open in the above analysis. First, 
consider the fluctuation in the number of sets of mea- 
surements that contain j, for any given j. On average 
this quantity is S2S, but there may be some fluctuations. 
However, the probability that there are fewer than s%S/2 
different such sets of measurements is exponentially small 
in S21S, and hence for the given choice of S2S, the quan- 
tity is bounded by l/poly(m) and so can be made negli- 
gible (in fact, this probability, being exponentially small 
in S2S, has a similar scaling as the probability that we 
incorrectly infer a given 2- 7-1 • ip^ given S2S sets of that 
contain j, as that probability is also exponentially small 
in S2S). So, with high probability all j are contained in 
at least S2S/2 measurements, and so we can double S 
and apply the analysis above. 

It is possible that a better way to deal with fluctuations 
in the number of measurements is to change the distri- 
bution of choices of j*, and anti-correlate the choices in 
different sets of measurements to reduce the fluctuations 
in the number of sets of measurements containing a given 
j. This will at best lead to a constant factor improve- 
ment. 

Another kind of correlation that we must deal with 
is correlation between the events that the quantity in 



given j, let us assume for a given set of measurements we 
have j\ — j. Let us refer to j\, ■■■,j l s as the "partners" 
of j. For a given j, the different sets of measurements 
involving that j will typically have wildly different part- 
ners of j; that is, for two different sets of measurements, 
m, n, we will typically have \j™ — j"| > m/S 2 » log(S) 
for a,b 7^ 1. So, for most sets of measurements, these 
will be independent. Similarly, in a given measurement 
we will typically have \j" 1 — f b n \ >> log(5) so we can 
ignore correlations between errors in different pj a . 

Of course, the above is not a rigorous proof, but we 
expect that such a proof can be provided without any 
significant difficulty. Note that if for a given j we have a 
large number of (roughly) independent sets of measure- 
ments containing that j, then adding a small number of 
correlated sets of measurements will not prevent the in- 
ference from working. 



B. Multiple Round Algorithm 

We can improve this procedure by increasing the num- 
ber of rounds. In the first and second rounds we proceed 
as before, though the constants Si, S2, S will be changed. 
Let us write S = S2 for the second round. The third 
round of the procedure is the same as the second round, 
except that we do S3 sets of measurements, and in each 
measurement we pick S3 different random values of j. On 
the third round, as in the second, we repeat each set of 
measurements C times; it is only the first round where 
the quantity C does not appear, for the reason that in 
that round, each measurement is already being repeated 
S\ times. We can increase the number of rounds indefi- 
nitely. In each round, we can exponentially increase the 
density compared to the previous round, while keeping 
all constants s a of order unity. The number of measure- 
ments required is then proportional to the number of 
rounds. Since S increases exponentially in each round 
and we need S ~ log(m) in the last round, the number 
of rounds required is ~ log* (m) and the total number of 
measurements is ~ m log* (m) . 



C. Classical Post-processing Time Required 

The simplest implementation of the algorithm above 
requires a time 0(mlog 2 (m)). We discuss this first and 
then discuss how to improve to 0(mlog(m)). Each bit 
is contained in ~ log(m) sets of measurements (indeed, 
the fact that it is contained in this many sets of mea- 
surements is the whole point of the algorithm). To com- 
pute the quantity in Eq. ( 35 ) , the sum on the right-hand 



side require summing over S different quantities, and for 
S ~ log(m), this means that it takes time ~ log(m) to do 
the computation for each bit for each set of measurements 
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containing that bit. So, with m bits, each contained in 

log(m) sets of measurements, the time is (3(m log (to)). 

However, we can slightly improve this by noting that 



Eq. ( 35 ) can be written as 



Ph + Ph 



Pjs 



)+Pn- 



The quantity in parentheses can be computed once for 
each set of measurements, and re-used in inferring each 
of the pj t for i G {1,...,S'}, and then it only requires 
0(1) time to do the arithmetic for each of these i. This 
improves the total time to 0(mlog(m)). 



VI. ANALYSIS OF QUANTUM CIRCUIT 
DEPTH AND WIDTH 



The fast phase estimation algorithm offers an asymp- 
totic improvement in the number of measurements re- 
quired to estimate the phase with exponential precision. 
How does the corresponding quantum circuit scale, in 
terms of depth, width and size? We define the depth 
of a quantum circuit as the number of timesteps, where 
gates on disjoint qubits can occur in parallel in a given 
timestep. Here we assume that a given n-qubit gate takes 
one timestep. The width of a quantum circuit is the num- 
ber of qubits. The size of a quantum circuit is the to- 
tal number of non-identity quantum gates. We analyze 
the circuits given three different computing settings, to 
emphasize tradeoffs in depth and width depending on re- 
source availability. Table IT] contains a summary of the 
circuit resources required for each algorithm given the 
setting. 

First, consider the setting where each measurement op- 
erator is performed sequentially. That is, the circuit is 
given by the sequence of gates (shown in Fig. [5]) 

H® ms Z{9 Ml )^{U Kh )[qi,A] Z{e M ,)K\U M ^[q2,A] . . . 
Z{e Mms )k l {U M ™){q ms ,A] H® ms , (38) 

where A n {U)[qi,q2\ denotes n-qubits in register qi con- 
trolling the application of gate U to register q 2 . The 
quantum register containing the eigenvector state is de- 
noted by \A) and consists say of a qubits. Each phase 
estimation algorithm performs 0(ms) measurements, re- 
sulting in a circuit of depth and size 0(ms). The cir- 
cuit requires 0(ms) ancilla qubits, one per measurement, 
plus a additional qubits. For Kitaev's phase estima- 
tion and fast phase estimation, s equals 0(log(m)) and 
O (log* (to)), respectively. Thus in the sequential setting, 
fast phase estimation offers an asymptotic improvement 
in circuit depth and size, as well as in the number of 
ancilla qubits. 

Second, consider a more parallel setting obtained by 
decreasing circuit depth at the cost of increasing circuit 
width. We can parallelize quantum phase estimation us- 
ing techniques presented in Refs.[TTIand(T3l The idea is 
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FIG. 5: Quantum circuit for sequential phase 
estimation. 
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FIG. 6: Quantum circuit for parallel phase estimation. 
Each wire represents a register of qubits. 



to apply one multi-controlled gate instead of the sequence 



in Eq. ( 38 1 , by evolving as 

\M) ® \A) -> \M) ® U M \A) 



where M is given by the sum of the multiples: 

ms 



(39) 



(40) 



The sum can be computed using a quantum addition 
circuit based on a 3-2 quantum adder (also called a carry- 
save adder) [HI [TH [15] , which reduces the sum of three 
TO-bit numbers to a sum of two encoded numbers in O(l) 
depth, with width 0(m) and size 0{m). Consider M to 
be a sum of s TO-bit integers. The circuit first uses a 
log(s)-depth tree of 3-2 adders to produce two encoded 
numbers, and then adds these two numbers in place us- 
ing a quantum carry-lookahead adder [16 with O(to) an- 
cillae, 0(m) size, and 0(\og(m)) depth. In total, the 
addition requires a quantum circuit of 0(ms) ancillae, 
0(log(s) +log(m)) depth, and 0(ms) size. 

The circuit for performing parallel phase estimation is 
shown in Figure [6] The circuit begins with a quantum 
register containing qubits initialized to |0). Each qubit 
qi undergoes a Hadamard operation, followed by a phase 
rotation by angle 9i about the z-axis. An addition circuit 
is applied to determine \M). A controlled U M operation 
is applied, followed by an addition circuit to uncompute 
\M). Finally, 0(ms) Hadamard operations and measure- 
ments are applied, which can be done in depth 0(1). The 
complete circuit for parallel phase estimation requires 
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TABLE I: Table of circuit depth, width, and size for Kitaev's quantum phase estimation and fast phase estimation. 



Type 


Kitaev's Phase Estimation |llj 


Fast Phase Estimation 


Depth 


Width 


Size 


Depth 


Width 


Size 


Sequential 


O(mlog(m)) 


0(mlog(m)) 


0(mlog(m)) 


0(m log* (m)) 


0(ralog*(m)) 


0(mlog*(m)) 


Parallel 


0(log(m)) 


0(mlog(m)) 


0(mlog(m)) 


0(log(m)) 


0(m log* (m)) 


0(m log* (m)) 


Cluster 


O(log(m)) 


0(771*) 


0(mlog(m)) 


0(log*(m)) 


0(771*) 


0(mlog*(m)) 
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VII. CONCLUSIONS AND FUTURE WORK 



Node m We have presented several algorithms for quantum 

phase estimation based on a basic measurement opera- 
tion and classical post-processing. Both our "information 
theory" algorithm and our fast phase estimation algo- 
rithm depend upon a randomized construction of which 
measurements to take, and have applications to classical 
signal processing and quantum phase estimation. Our 
fast phase estimation algorithm achieves asymptotic im- 
provements in circuit depth, width, and size over Kitaev's 

Node 1 phase estimation, resulting in significant reductions in re- 

source requirements including circuit depth and size, and 
the number of ancilla qubits. Remarkably, when using an 
m-node cluster of quantum computers, our algorithm re- 
quires essentially constant time. It is an interesting ques- 
tion for future work to de-randomize these algorithms. 



FIG. 7: Quantum circuit for parallel phase estimation 
across a cluster consisting of m nodes. 



0{ms) size and 0(log(s) +log(m)) depth, up to the im- 
plementation of the multi-controlled U M gate. Again, 
s equals 0(log(m)) for Kitaev's phase estimation and 
0(log*(m)) for fast phase estimation yielding a signifi- 
cant reduction in circuit size and width to 0(mlog*(m)). 

Third, consider access to a cluster of quantum com- 
puters containing m nodes. Each node performs s mea- 
surements, resulting in a depth of O(s), with a size and 
width per node of O(s) gates and 0(s + a) qubits, re- 
spectively. The cumulative cost across all m nodes is 
O(s) depth, 0{ms) size, and 0(ms + ma) qubits. Again, 
fast phase estimation yields asymptotic improvements in 
all dimensions, and results, for all practical purposes, in 
a constant-depth phase estimation circuit. One poten- 
tial advantage of the cluster model is that errors do not 
accumulate on the eigenvector state \A), since subsets of 
measurements are done on separate nodes. This could 
be advantageous when designing a fault-tolerant phase 
estimation algorithm. 

Table [I] summarizes the circuit size, depth, and width 
for the various settings of the two algorithms. Fast phase 
estimation yields asymptotic improvements in each di- 
mension. 
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